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Virology Division, United States Army Medical Research Institute of 
Infectious Diseases, Fort Detrick, Frederick, Maryland 2 170 1-50 11. 

Three serologically distinct groups of hantaviruses have been associated v^ith 
severe, moderate, and mild forms of hemorrhagic fever' with renal syndrome • 
(HFRS). To gain a better understanding of the genetic variation among these 
viruses, we cloned and sequenced the M and the S genome segments of 
Sapporo rat virus, an etiologic agent of moderate HFRS, and compared the 
predicted gene products to those of Hantaan virus, and the Hallnas strain of 
Puumala virus, which are etiologic agents of severe and mild HFRS, 
respectively. The SR-1 1 S segment consisted of 1769 nucleotides and had an 
open reading frame (ORF) in the virus-complementary sense RNA with a 
coding capacity of 429 amino acids. Deduced amino acids from the SR-1 1 S 
segment ORF displayed 83% homology with those of Hantaan nucleocapsid 
(N) protein. Comparison of the S segment ORFs of all three viruses revealed 
58% homology. No evidence for additional nonstructural protein(s) encoded 
by the SR-1 1 S segment was obtained. The SR-1 1 M segment consisted of 
3651 nucleotides and had an ORF in the virus-complementary sense RNA 
with a coding capacity of 1 134 amino acids. Amino acid sequences predicted 
from the SR-1 1 M segment ORF were 75% homologous with those 
encoding Hantaan Gl and G2 envelope glycoproteins. Comparison of the 
deduced amino acid sequences of the M segment ORFs of SR-1 1, Hantaan, 
and Hallnas viruses revealed a 43% homology for amino acids constituting 
the Gl proteins and a 55% homology for amino acids constituting the G2 
proteins of the three viruses. The envelope proteins of SR-1 1 virus were 
localized within the M segment ORF by amino-terminal sequence analysis of 
purified Gl and G2. Gl initiated at amino acid 17 and G2 at amino acid 647 
within the ORF. Five potential asparagine-linked glycosylation sites were 
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identified in the SR-1 1 Gl coding sequences, four of which were conserved 
between Hantaan and SR-1 1 viruses and three of which were conserved 
among all three viruses. One potential glycosylation site was identified in the 
SR-1 1 G2 coding sequences and was conserved among Hantaan, SR-11 and 
Hallnas viruses. Cysteine residues were highly conserved within the M 
segment ORFs of all three viruses, suggesting a similar structure and function 
of the Gl and G2 proteins. 

PMID: 1970443 
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□ 1: M34882 Sapporo rat virus mRNA M segment, PubMed, Protein, Related Sequences, Taxonomy 
encoding an ORF, complete cds 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
MEDLINE 
FEATURES 

source 



CDS 



SRVAGSM 3651 bp SS-RNA VRL '03-AUG-1993 

Sapporo rat virus mRNA M segment, encoding an ORF, complete cds. 
M34882 

M34882.1 GI:335017 

Sapporo rat virus (strain SR-11) , cDNA to viral RNA, passed in Vero 
E6 cell line C1008. 
Sapporo rat virus 

Viruses; ssRNA negative-strand viruses; Bunyaviridae; Hantavirus. 
1 (bases 1 to 3651) 

Arikawa,J., Lapenotiere, H. F. , lacono-Connors, L , , Wang,M.G. and 
Schmal john, C. S . 

Coding properties of the S and the M genome segments of Sapporo rat 
virus: Comparison to other causative agents of hemorrhagic fever 
with renal syndrome 
Virology 176, 114-125 (1990) 
90232720 

Location/Qualifiers 
1..3651 

/organism="Sapporo rat virus" 
/db__xref="taxon: 11607" 
47. .3448 

/note="M segment ORF" 
/codon_start=l 
/protein id=^" AAA47825 . 1 " 
/db_xref="GI : 335018" 

/translation="MWSLLLLAALVGQGFALKNVFDMRIQLPHSVNFGETSVSGYTEF 
PPLSLQEAEQLVPESSCNMDNHQSLSTINKLTKVIWRKKANQESANQNSFEWESEVS 
FKGLCMLKHRMVEESYRNRRSVICYDLACNSTFCKPTVYMIVPIHACNMMKSCLIGLG 
PYRIQWYERTYCTTGILTEGKCFVPDKAWSALKRGMYAIASIETICFFIHQKGNTY 
KIVTAITSAMGSKCNNTDTKVQGYYICIIGGNSAPVYAPAGEDFRAMEVFSGIITSPH 
GEDHDLPGEEIATYQISGQIEAKIPHTVSSKNLKLTAFAGIPSYSSTSILAASEDGRF 
IFSPGLFPNLNQSVCDNNALPLIWRGLIDLTGYYEAVHPCNVFCVLSGPGASCEAFSE 
GGIFNITSPMCLVSKQNRFRAAEQQISFVCQRVDMDIIVYCNGQKKTILTKTLVIGQC 
lYTITSLFSLLPGVAHSIAIELCVPGFHGWATAALLITFCFGWVLIPACTLAILLVLK 
FFANILHTSNQENRFKAILRKIKEEFEKTKGSMVCEICKYECETLKELKAHNLSCVQG 
ECPYCFTHCEPTETAIQAHYKVCQATHRFREDLKKTVTPQNIGPGCYRTLNLFRYKSR 
CYILTMWTLLLIIESILWAASAAEIPLVPLWTDNAHGVGSVPMHTDLELDFSLPSSSK 
YTYKRHLTNPVNDQQSVSLHIEIESQGIGAAy 

QYPWHTAKCHFEKDYEYENSWACNPPDCPGVGTGCTACGLYLDQLKPVGTAFKIISVR 
YSRKVCVQFGEEHLCKTIDMNDCFVTRHAKICIIGTVSKFSQGDTLLFLGPMEGGGII 
FKHWCTSTCHFGDPGDVMGPKDKPFICPEFPGQFRKKCNFATTPVCEYDGNIISGYKK 
VLATIDSFQSFNTSNIHFTDERIEWRDPDGMLRDHINIVISKDIDFENLAENPCKVGL 
QAANIEGAWGSGVGFTLTCKVSLTECPTFLTSIKACDMAICYGAESVTLSRGQNTVKI 
TGKGGHSGSSFKCCHGKECSSTGLQA3APHLDKVNGISELENEKVYDDGAPECGITCW 
FKKSGEWVMGIINGNWWLIVLCVLLLFSLILLSILCPVRKHKKS" 
692 c 755 g 1085 t 



1119 a 



BASE COUNT 
ORIGIN 

1 tagtagtaga ctccgcaaga aacagcagtt aaagaacaat aggatcatgt ggagtttgct 
61 attactggcc gctttagttg gccaaggctt tgcattaaaa aatgtatttg acatgagaat 
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tcagttgccc 
cccactctca 
ccaccagtca 
tcaggaatca 
gttgtgtatg 
ctgttatgat 
tatacatgct 
ggttgtctat 
ccctgacaag 
gacaatctgc 
atcagcaatg 
tattattggt 
ggaggttttt 
agaaatcgca 
ctccaaaaac 
attggctgct 
tcagtcagtc 
gggatactat 
ttcatgtgag 
gtctaagcaa 
tgatatggat 
agttataggc 
ccattctatt 
tttgattaca 
agtccttaag 
cattctacgg 
ctgtaagtat 
aggagagtgc 
ttacaaagtt 
tcaaaatatt 
ttatattctg 
tgcagcagaa 
tcctatgcat 
caaaagacat 
tgaaagtcaa 
gaatctaaaa 
cactgcaaaa 
cccagattgc 
gaagccggta 
g'cagtttggt 
gcatgccaaa 
atttctgggg 
tcactttgga 
attcccaggg 
tggaaacatt 
taacacaagc 
gcttcgggat 
gaatccttgt 
cgggtttaca 
aaaggcctgt 
aaatactgtc 
tgggaaagaa 
tggtatctct 
tacttgttgg 
tgtcctaatt 
tcctgttaga 
attttcaaac 
catttttatt 
tttattgatt 



cactcagtca 
ttacaggagg 
ctctcaacaa 
gcaaaccaga 
ttaaagcata 
ctagcctgta 
tgcaacatga 
gaaaggacat 
gctgttgtca 
ttttttattc 
ggctccaaat 
ggaaactccg 
tctgggatta 
acgtaccaga 
ttaaaattga 
tcagaagatg 
tgtgacaaca 
gaggcagtcc 
gccttttcag 
aatagattta 
attatagtgt 
caatgtattt 
gctattgagt 
ttctgcttcg 
ttctttgcaa 
aaaataaagg 
gagtgtgaaa 
ccatattgct 
tgtcaagcca 
gggccaggct 
acaatgtgga 
atcccccttg 
acggatcttg 
ctcacaaacc 
ggcattggtg 
acctcatttc 
tgccattttg 
ccaggggttg 
ggaacagcct 
gaagaacacc 
atatgtataa 
cccatggaag 
gaccctggtg 
caatttagga 
atatcaggct 
aatatacact 
catattaata 
aaagtagggc 
ctcacatgca 
gacatggcaa 
aaaattaccg 
tgttcatcaa 
gagttagaaa 
tttaaaaaat 
gtcttgtgtg 
aagcataaaa 
actttatcat 
tttatatgga 
caccggggtg 



actttgggga 
cagaacagct 
taaataaatt 
attcatttga 
gaatggttga 
atagtacatt 
tgaaaagctg 
actgcactac 
gtgcattgaa 
atcagaaagg 
gtaataatac 
cccctgtata 
ttacatcacc 
tttcagggca 
ctgcttttgc 
gtcgtttcat 
atgcactccc 
acccttgcaa 
aaggaggtat 
gagcagctga 
actgtaatgg 
atactattac 
tgtgtgttcc 
gctgggtatt 
atatccttca 
aggagtttga 
cattaaagga 
ttacccactg 
cccaccgatt 
gttaccgaac 
ctcttcttct 
tccctctctg 
aattagactt 
cagttaatga 
ctgctgttca 
attgttatgg 
agaaagatta 
gtacaggttg 
ttaaaattat 
tttgtaaaac 
ttgggactgt 
gaggtggtat 
atgtcatggg 
aaaaatgtaa 
ataagaaagt 
tcactgatga 
ttgttatttc 
tccaggcagc 
aggtgtctct 
tttgttatgg 
ggaaaggtgg 
ctggcctcca 
acgagaaagt 
caggtgaatg 
tactgctgct 
aatcataaat 
ttagaactta 
ttaattacta 
ctgtcttgac 



aacaagtgtg 
agtgccagag 
aaccaaggtc 
agttgtggaa 
agaatcatat 
ctgtaaacca 
tttgattggc 
gggtatattg 
aagaggcatg 
gaatacatat 
agatactaaa 
tgcccctgct 
acatggagaa 
gatagaggca 
aggtattcca 
atttagtcct 
tttaatctgg 
tgtgttctgt 
tttcaatatt 
gcagcagatt 
tcagaaaaaa 
aagtctcttt 
agggtttcat 
gattcctgca 
tacaagcaat 
aaaaacaaag 
attgaaggca 
tgaaccgaca 
cagagaagat 
actaaatctt 
cattattgaa 
gacagataat 
ctctttgcca 
ccaacagagt 
tcatcttgga 
tgcctgcaca 
tgagtatgaa 
tactgcttgt 
aagtgtaaga 
aattgatatg 
atctaagttt 
aatctttaaa 
tccaaaagat 
ctttgccaca 
acttgcaaca 
gagaattgaa 
taaagatatt 
aaacatagaa 
cacagaatgc 
tgcagaaagt 
ccatagtggt 
agccagtgca 
ttatgatgac 
ggttatgggt 
cttttctctt 
cccacctaac 
acttggcact 
aaaaaaatac 
atctggcggc 



tcaggctata 
agctcatgca 
atatggcgga 
agtgaagtca 
agaaatagga 
actgtttata 
cttggcccct 
acagaaggaa 
tatgctatag 
aagatagtga 
gttcaaggat 
ggtgaagact 
gaccatgacc 
aaaatccctc 
tcatactcat 
ggtttatttc 
aggggcctaa 
gtcttatcag 
acttctccaa 
agctttgtct 
acaatcctaa 
tcactgttac 
ggctgggcca 
tgtacattag 
caagagaacc 
ggttccatgg 
cataacctat 
gaaactgcaa 
ttaaaaaaga 
tttaggtata 
tccatcctct 
gctcatggcg 
tccagttcta 
gtctcattgc 
cattggtatg 
aaatatcaat 
aatagctggg 
ggattatatc 
tacagtagaa 
aatgattgct 
tctcaaggtg 
cactggtgta 
aaaccattta 
actccagttt 
attgattctt 
tggagagacc 
gattttgaaa 
ggtgcctggg 
ccaacatttc 
gtgacactct 
tcttcattca 
ccacatctgg 
ggtgcacctg 
ataatcaatg 
atcctgttga 
aatcttcaca 
actatctgat 
tctcttctat 
gtctactact 



cagaatttcc 
acatggacaa 
aaaaagcaaa 
gctttaaagg 
gatcagtaat 
tgattgttcc 
acagaatcca 
aatgctttgt 
caagcataga 
ctgccattac 
attatatctg 
tcagagcaat 
tacccggcga 
atacagtgag 
caactagtat 
ctaacctaaa 
ttgatttaac 
gaccaggtgc 
tgtgtctggt 
gccaaagagt 
caaaaacatt 
caggggttgc 
cagctgcact 
ctattctttt 
gattcaaagc 
tttgtgagat 
catgtgttca 
ttcaggcaca 
ctgtaactcc 
aaagtaggtg 
gggcagcaag 
ttgggagtgt 
agtacacata 
atatagaaat 
atgcaagatt 
acccatggca 
cttgcaaccc 
tagatcaatt 
aagtgtgcgt 
ttgtgactag 
acactctact 
catctacctg 
tttgccctga 
gtgaatatga 
tccaatcatt 
ctgatggcat 
atttggctga 
gttcaggtgt 
ttacatcaat 
cacgaggaca 
aatgctgtca 
ataaggtaaa 
aatgtggcat 
ggaactgggt 
gcatcttgtg 
tcatgtatcg 
aactgacttt 
ctcccaatct 
a 
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Q 1: M34881 Sapporo rat virus mRNA S segment, PubMed, Protein, Related Sequences, Taxonomy 
encoding an ORF, complete cds 



LOCUS SRVAGSS 1769 bp ss-RNA VRL 03-AUG-1993 

DEFINITION Sapporo rat virus mRNA S segment, encoding an ORF, complete cds. 

ACCESSION M34881 

VERSION M34881.1 GI:335019 

KEYWORDS 

SOURCE Sapporo rat virus (strain SR-11) , cDNA to viral RNA, passed in Vero 

E6 cell line C1008. 
ORGANISM Sapporo rat virus 

Viruses; ssRNA negative-strand viruses; Bunyaviridae; Hantavirus. 
REFERENCE 1 (bases 1 to 1769) 

AUTHORS Arikawa,J., Lapenotiere, H . F . , lacono-Connors, L . , Wang,M.G. and 
Schmal j ohn, C. S . 

TITLE Coding properties of the S and the M genome segments of Sapporo rat 

virus: Comparison to other causative agents of hemorrhagic fever 
with renal syndrome 
JOURNAL Virology 176, 114-125 (1990) 
MEDLINE 90232720 
FEATURES Location/Qualifiers 
source 1 . . 1769 

/organism="Sapporo rat virus" 
/db_xref="taxon: 11607" 
CDS 43. .1332 

/note="S segment ORF" 
/ codon_start=l 
/protein id=" AAA47 826 . 1 " 
/db_xref="GI : 335020" 

/translation="MATMEEIQREISAHEGQLVXARQKVKDAEKQYEKDPDDLNKRAL 
HDRESVAASIQSKIDELKRQLADRLQQGRTSGQDRDPTGVEPGDHLKERSALSYGNTL 
DLNSLDIDEPTGQTADWLTIIVYLTSFWPIILKALYMLT.TRGRQTSKDNKGMRIRFK 
DDSSYEDVNGIRKPKHLYVSMPNAQSSMKAEEITPGRFRTAVCGLYPAQIK7\RNMVSP 
VMSWGFLALAKDWTSRIEEWLGAPCKFMAESLIAGSLSGNPVNRDYIRQRQGALAGM 
EPKEFQALRQHSKDAGCTLVEHIESPSSIWVFAGAPDRCPPTCLFVGGMAELGAFFSI 
LQDMRNTIMASKTVGTADEKLRKKSSFYQSYLRRTQSMGIQLDQRIIVMFMVAWGKEA 
VDN FHLGDDMD PE LRS LAQ I LI DQKVKE I SNQE PMKL " 

BASE COUNT 569 a 331 c 410 g 459 t 

ORIGIN 

1 tagtagtaga ctccctaaag agctactaca ctaacaagaa aaatggcaac tatggaagaa 
61 atccagagag aaatcagtgc tcacgagggg cagcttgtga tagcacgcca gaaggtcaag 
121 gatgcagaaa agcagtatga gaaggatcct gatgacttaa acaagagggc actgcatgat 
181 cgggagagtg tcgcagcttc aatacaatca aaaattgatg aattgaagcg ccaacttgcc 
241 gacagattgc agcagggaag aacatccggg caggaccggg atcctacagg ggtagagcca 
301 ggtgatcatc ttaaggaaag atcagcacta agctacggga atacactgga cctgaatagt 
361 cttgacattg atgaacctac aggacagaca gctgattggc tgaccataat tgtctatctg 
421 acatcattcg tggtcccgat catcttgaag gcactgtaca- tgttaacaac acgaggtagg 
481 cagacttcaa aggacaacaa ggggatgagg atcagattca aggatgacag ctcatatgag 
541 gatgtcaatg gaatcagaaa gcccaaacat ctgtatgtgt caatgccaaa cgcccaatcc 
601 agcatgaagg ctgaagagat aacaccagga agattccgca ctgcagtatg tggactatat 
661 cctgcacaga taaaggcaag gaatatggta agccctgtca tgagtgtagt tgggttcttg 
721 gcactggcaa aagactggac atcgagaatt gaagaatggc tcggtgcacc ctgcaaattc 
781 atggcggagt ctcttattgc cgggagttta tctgggaatc ctgtgaatcg tgactatatc 



NCBJ Sequence Viewer http://www.ncbi Jilm Jiih.gov:80/entrez/qu...Nucleotide&list_uids=3350 19&dopt=GenBan 

841 agacagagac aaggtgcact tgcagggatg gagccaaagg aatttcaagc cctcaggcaa 

901 cattcaaagg atgctggatg tacactagtt gaacatattg agtcaccatc atcaatatgg 

961 gtgtttgctg gggcccctga taggtgtcca ccaacatgct tgtttgtcgg agggatggct 

1021 gaattaggtg ccttcttttc tatacttcag gatatgagga acacaatcat ggcttcaaaa 

1081 actgtgggca cagctgatga aaagcttcga aagaaatcat cattctatca atcatacctc 

1141 agacgcacac aatcaatggg aatacaactg gaccagagga taattgttat gtttatggtt 

1201 gcctggggaa aggaggcagt ggacaacttt catctcggtg atgacatgga tccagagctt 

1261 cgtagcctgg ctcagatctt gattgaccag aaagtgaagg aaatctcaaa ccaggaacct 

1321 atgaaattat aagtacataa atatataatc aatactaact ataggttaag aaatactaat 

1381 cattagttaa taagaatata gatttattga ataatcatat taaataatta ggtaagttaa 

1441 ctagtattta gttaagttag ctaattgatt tatatgattg tcacaattaa atgtaatcat 

1501 aagcacaatc actgccatgt ataatcacgg gtatacgggt ggttttcata tggggaacag 

1561 ggtgggctta gggccaggtc accttaagtg accttttttt gtatatatgg atgtagattt 

1621 caattgatcg aatactaatc ctactgtcct cttttctttt cctttctcct tctttactaa 

1681 caacaacaaa ctacctcaca ccttaatata tactacttta ttaagttgtt aagttgtgtc 

1741 tttttgggga gtaagggagt ctactacta 

// 
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